Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing

نویسندگان

  • Gary Geunbae Lee
  • Jong-Hyeok Lee
  • Kyubong Park
  • Byung-Chang Kim
چکیده

This paper presents a multi-strategic and hybrid approach for large-scale integrated speech and natural language processing, employing connectionist, statistical and symbolic techniques. The developed spoken Korean processing engine (SKOPE) integrates connectionist TDNN-based phoneme recognition technique with statistical Viterbi-based lexical decoding and symbolic morphological/phonological analysis techniques. The modular large-scale TDNNs are organized to recognize all 41 Korean phonemes using 10 component networks combined through 3 glue networks. In performance phase, continuously shifted TDNN outputs are integrated with HMM-based Viterbi decoding using a tree-structured lexicon. The Viterbi beam search is integrated with Korean morphotactics and phonological modeling, and produces a morpheme-graph for high-level parsing module. Currently, SKOPE shows average 76.2% phoneme spotting performance for all 41 Korean phonemes (including silence) from continuous speech signals and exhibits average 92.6% morpheme spotting performance from erroneous TDNN outputs after morphological analysis. Other extensive experiments verify that the multi-strategic approaches are promising for complex integrated speech and natural language processing, and the approaches can be extended to other morphologicallycomplex agglutinative languages such as Japanese.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SKOPE: A connectionist/symbolic architecture of spoken Korean processing

Spoken language processing requires speech and natural language integration. Moreover, spoken Korean calls for unique processing methodology due to its linguistic characteristics. This paper presents SKOPE, a connectionist/symbolic spoken Korean processing engine, which emphasizes that: 1) connectionist and symbolic techniques must be selectively applied according to their relative strength and...

متن کامل

Integrated speech and morphological processing in a connectionist continuous speech understanding for Korean

A new tightly coupled speech and natural language integration model is presented for a TDNN-based continuous possibly large vocabulary speech recognition system for Korean. Unlike popular n-best techniques developed for integrating mainly HMM-based speech recognition and natural language processing in a word level, which is obviously inadequate for morphologically complex agglutinative language...

متن کامل

SCREEN: Learning a Flat Syntactic and Semantic Spoken Language Analysis Using Artificial Neural Networks

Previous approaches of analyzing spontaneously spoken language often have been based on encoding syntactic and semantic knowledge manually and symbolically. While there has been some progress using statistical or connectionist language models, many current spoken-language systems still use a relatively brittle, hand-coded symbolic grammar or symbolic semantic component. In contrast, we describe...

متن کامل

Screen: Learning a Flat Syntactic and Semantic Spoken Language Analysis Using Artiicial Neural Networks

Previous approaches of analyzing spontaneously spoken language often have been based on encoding syntactic and semantic knowledge manually and symbolically. While there has been some progress using statistical or connectionist language models, many current spoken-language systems still use a relatively brittle, hand-coded symbolic grammar or symbolic semantic component. In contrast, we describe...

متن کامل

SCREEN : Flat Syntactic and Semantic Spoken Language

Previous approaches of analyzing spontaneously spoken language often have been based on encoding syntactic and semantic knowledge manually and symbolically. While there has been some progress using statistical or connectionist language models, many current spoken-language systems still use a relatively brittle, hand-coded symbolic grammar or symbolic semantic component. In contrast, we describe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996